An Adaptive Approach: Text Line Extraction from Multi-Skewed Hand Written Documents
ثبت نشده
چکیده
Advancing technology has made document image processing an important feature in automation of office documentation. Digital filing system save space, paper and printing cost. The problem arises when document to be read is not placed correctly in scanner, which leads to the miss interpretation of document and increases the storage space .This paper deals with extraction of text from those skewed document and proper alignment of those texts.
منابع مشابه
روش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کاملUsing Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents
This paper presents a novel approach for text line extraction which is based on Gaussian scale space, a dedicated binarization, and an energy minimization framework. It enhances the text lines in the image using multi-scale anisotropic second derivative of Gaussian filter bank at the average height of the text line. It then applies a binarization, which is based on component-tree and is tailore...
متن کاملارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متنکاوی در حوزه یادگیری الکترونیکی
As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...
متن کاملThreshold Approach to Handwriting Extraction in Degraded Historical Document Images
Handwriting extraction is the skill of a system to get and translate comprehensible hand written input via sources such as document, photos, tough screen and other devices. The picture of the written document is used to detect written text by the use of optical scanning i.e. known as optical character recognition. Handwriting extraction basically uses optical character recognition. Conversely, ...
متن کاملPerformance of Content Based Mining Approach for Multi - lingual Textual Data
Data mining has become a necessary and powerful tool in the present era of web and internet communications. It has also evolved into media mining wherein heterogeneous data inputs like figures, videos and audios are gradually getting embedded into the web and this makes it quite complex and different. These and other aspects like currency and ‘liveliness’ of the web bring in more interesting fe...
متن کامل